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METHODS FOR DETERMINING COAT COLOUR GENOTYPES IN PIGS 

The present invention relates to methods for determining coat colour genotype 
in pigs. In particular, it relates to methods of distinguishing between the alleles 
5 •/,/*, 7* and i of the TOT gene. 

Coat colour is important to the pig breeding industry for a number of reasons. 
Firstly in a number of markets there is a preference for white skinned meat 
This is due to the fact that pork is often marketed with the skin still attached, 
10 and skins from coloured pigs, even if dehaired, can still exhibit coloured hair 
roots, which can lead to negative perception by the consumer, since the surface 
of the meat may appear to be spotted by mould. It is therefore necessary in 
these markets to remove the skin from such carcasses, entailing additional cost. 

15 For example, in the US, coloured carcasses are associated with approximately 
1% of skin defects requiring dehairing and skinning to remove pigment. As a 
result of this, coloured pig carcasses are generally discounted. Secondly gross 
variation in the appearance of pigs claimed to be genetically consistent for other 
traits can lead to questions about the consistency and quality of the animals in 

20 the mind of pig-producing customers. Breeders would also like to be able to 
ensure consistency in breeding populations. Thus breeders may wish to ensure 
that progeny produced by breeding crosses were always white. Alternatively a 
breeder producing a coloured breed may wish to ensure that the correct coat 
colour characteristics were maintained even during the introgression of genes 

25 from white lines. 
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White coat colour in pigs is controlled by the dominant white locus designated / 
(for inhibition of coat colour). Structural alterations in the porcine KIT gene 
have recently been correlated with various alleles of I and are probably 
responsible for the differences in coat colour pattern found or are closely linked 
to the mutations that are found (Johansson Moller et al 1996 Mammalian 
Genome 7, 822-830). Four structural versions of the porcine KIT gene have 
been identified to date and designated 7, F, I* and i (see figure 1 and table 1). 
The version found in fully coloured animals including wild boar and which is 
therefore generally accepted as the wild type allele is /. The other versions of 
the gene known all involve duplication of at least part of the porcine £77* gene. 
F is a partially dominant allele and causes in the heterozygous state (F/i) a 
phenotype "patch" characterised by patches of white and coloured coat. I and /* 
are both fully dominant and cause a white phenotype both in the heterozygous 
and homozygous state. The only difference between I and /* is that there is a 
4bp deletion in intron 18 in one of the two KIT gene copies associated with the 
I allele. This sequence polymorphism is not expected to have any functional 
effect. 

The phenotypes of a number of gene combinations are listed below with. the I 
allele being dominant over F and /, and F being dominant over i (Johansson et 
al. 1992 Genomics 14, 965-969) 

Genotype Colour 

/// White 

I/F White 

/// White 
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F/i 



Patched 



Hi 



Coloured 



I*li 



White 



5 Previously Moller Johansson et al. (1996 Mammalian Genome 7, 822-830) 

revealed that KIT occurs as a single copy gene in coloured (i/i) animals but is 
duplicated in the /, / and F allele. This duplication allowed the differentiation 
of/, f and F from i through examination of the gene copy number of KIT and 
to use linked polymorphisms in or in the near vicinity of the KIT gene to 
10 distinguish different alleles at the / locus. This approach is the subject of 
International patent WO97/05278. 

A problem remaining from this method however is that it does not distinguish 
between 7* and F. The result of this is that if one wishes to use the screen to 

15 remove heterozygous carrier animals of genotype I/F from a white pig line one 
also excludes ///* animals unnecessarily. The removal of animals, potentially 
very valuable if at the top of a breeding pyramid, unnecessarily can lead to a 
reduction in the rate of improvement of other traits within that particular group 
of animals. Other consequences include a general loss of genetic diversity and a 

20 loss of alleles at the locus in question that may have as yet undetected value. It 
is essential that one only excludes alleles where absolutely necessary. 

Further sequence analysis of genomic DNA revealed a mutation at the first 
nucleotide in intron 17 of KIT2 leading to a defect in the splicing of RNA 
25 transcribed from this particular copy of the KIT gene. In experiments involving 
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several pig breeds, this mutation showed a complete concordance with the 
presence of the / and I* alleles but was not found in the i and F alleles 

The majority of eukaryotic genes are composed of both coding (exon) and 
5 intervening noncoding (intron) regions. The latter of these sequences are 
removed from large pre-mRNA by a highly accurate cleavage and ligation 
reaction known as splicing. The result is a mature mRNA transcript, devoid of 
intron sequences, which is transported to the cytoplasm for translation. The 
splicing of eukaryotic genes is most likely a two step procedure. First, the pre- 

10 mRNA is cleaved at the 5' (donor) splice site followed by cleavage at the 3' 
(acceptor) splice site. The second step involves rejoining of the spliced exons to 
result in exclusion of the intervening intron. Splicing is therefore critically 
dependant on the accuracy of the cleavage and ligation reactions. This accuracy 
appears to be dependant on the almost completely invariant GT and AG 

15 dinucleotides present at the 5' and 3' exon/intron boundaries respectively. 
These dinucleotides and the often highly conserved surrounding sequences are 
known as splice sites and serve to bind the protein factors required to perform 
the cleavage and ligation reactions. 

20 The consequences for mutation occurring within a splice site may be a 
reduction in the amount of mature mRNA produced and/or ultilization of 
alternative but incorrect splice sites in the vicinity. The result is production of 
mRNA which either contains additional intron sequence or which may lack a 
portion of coding sequence. Where mutation occurs within the 5' (donor) splice 

25 site and prevents binding of protein factors, the exon is no longer recognised as 
such and is excised along with its neighbouring introns. This is referred to as 
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'exon skipping' (as reviewed by Cooper and Krawczak, 1994 Human Gene 
Mutation. Bios Scientific Publishers, Oxford, UK, 1994). 

In the mutation identified here the G of the conserved GT pair at the 
5 exonl7/intronl7 boundary region is altered to an A. That there is an alteration 
in the messenger RNA that is translated into protein and that the presence of the 
change correlates with the white/patched phenotypes suggests that this is the 
functional mutation rather than merely a linked marker. The splice variant is 
expected to give rise to a defective protein as 41 amino acids in the mature 
10 protein are missing. We therefore assume that this splice mutation is the causal 
mutation for the difference between / and f. Our current knowledge as regards 
molecular differences between alleles at the / locus is summarised in Table 1 . In 
conclusion, we have identified two functionally important mutations. One is the 
gene duplication present in f t I* and / which by itself appears to cause the 
15 patch phenotype. The exact reason for this phenotypic effect is not known but 
one can speculate on the basis of comparative data from the mouse that the 
duplicated copy of the KIT gene may lead to a defect in gene expression which 
in turn affects melanocyte migration. This is especially valuable in that there 
can be no breakdown of the linkage between the DNA polymorphism and the 
20 trait itself. This has allowed us to develop a range of assays for the 
determination of the presence of the DNA polymorphism and the genotype of 
the animal with regard to coat colour determination to a significantly greater 
extent than has been possible before. The second mutation is the splice mutation 
that occurs in one of the KIT copies associated with the dominant white (/ and 
25 I*) alleles. The expression of a truncated form of the KIT protein is expected to 
cause a more severe defect in KIT function and a more severe effect on coat 
colour. 
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Table 1. 



5 



10 



Allele 


KIT gene 


Intron 17 


Associated Phenotype 


i 


KIT1 


Normal 


Coloured 


/ 


KIT1 


Normal 


Patch 




KIT2 


Normal 




/ 


KIT1 


Normal 


Dominant White 




KIT2 


Mutated 






KIT1 


Normal 


Dominant White 




KIT2 


Mutated 





In addition to these two functionally important mutations, a mutation in the KIT 
20 gene with no known phenotypic effect, the 4 bp deletion in intron 18 has been 
documented at the J locus as described in International patent application No. 
PCT/GB96/01794. 

Thus, in a first aspect the present invention provides a method for determining 
coat colour genotype in a pig which comprises: 
25 (a) obtaining a sample of pig nucleic acid; and 

(b) analysing the nucleic acid obtained in (a) to determine whether a mutation 
is/is not present at one or more exon/intron splice sites of the KIT gene. 

In particular, the method determines whether a mutation is/is not present at the 
30 exon 17/intron 17 boundary, eg the substitution of the G of the conserved GT 
pair for A. 
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Reverse Transcriptase based Polymerase Chain analysis (RT-PCR) of the exon 
16-19 region of KIT mRNA in animals of the Large White breed {III) and 
comparison to that transcribed in Hampshire animals (i/z) revealed an extra 
species of molecule in the former animals. RT-PCR analysis of both breeds 
yielded a product fragment with a length of 424bp indicating that both types of 
animal contained a ATT mRNA transcript containing a region corresponding to 
this region of the gene. However in addition the HI animals yielded a RT-PCR 
product of 301bp indicating the presence of an mRNA species which did not 
contain the full transcription of the exon 16-19 DNA sequence (see example 1). 
These two transcripts have been shown to be derived from the separate 
duplicate copies of the KIT gene associated with the / allele. Sequencing over 
the KIT exon 17/intron 17 boundary revealed a difference in the intron 
boundary sequences present in the two duplicate copies of the KIT gene 
associated with the / allele (see example 2). The sequences are as shown 
below: 



Allele 


Kit Gene 


Exon 17 


Intron 17 


i 


KIT1 


. . • 


GTG 


f 


KIT1 


AAC 


GTG 


F 


KIT2 


AAC 


GTG 


I 


KJT1 


• • • • •^A.i^C' 


GTG 


I 


KIT2 


AAC 


ATG 



This alteration of the 5' intron splice site from a GT pair to an AT pair affects 
the splicing of the pre mRNA and results in the loss of the whole of exon 17 
from the mRNA transcribed from the 7-KIT2 sequence. This will result in a 
modified KIT protein with the associated alterations in function and phenotype. 
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Based upon the sequence polymorphism rapid tests can be developed to 
determine the alleles carried by a specific animal at the dominant white locus. 

In one form such a test would comprise amplification of the region through the 
polymerase chain reaction (PCR) utilising genomic DNA from the animal in 
question as a template. The nucleotide sequence CATG comprises a recognition 
sequence for the restriction enzyme NlaHL. This sequence is only present in the 
I-KIT 2 sequence at the junction position and thus one can differentiate the 
DNA molecules amplified from the two alleles by digestion of the amplification 
products with this enzyme or any other restriction enzyme with a suitable 
recognition site. 

Genomic DNA for use in such a test can be prepared by a wide range of 
available methods, from any tissues or products derived from the animal in 
question. A 175bp fragment of the KIT gene containing the exon 17/intron 17 
boundary region can be amplified from porcine genomic DNA using a pair of 
primers such as: 

KIT21 (5'-GTA TTC ACA GAG ACT TGG CGG C-3'); and 
KIT35 (5'-AAA CCT GCA AGG AAA ATC CTT CAC GG-3*). 

The use of such primers in a PCR-RFLP test yields a fragment of 175bp before 
digestion with Nlalll. In alleles with the G present at nucleotide position 1 of 
intron 17 (I-KIT1 type sequence) there is only one NlaUl cleavage site present 
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41bp from the end of the fragment This site is present in all versions of the 
KIT sequence. Thus digestion of the 175bp fragments obtained from i and f 
alleles yields two fragments of 134bp and 41 bp. Where the G is mutated to A 
as in the I-KIT2 sequence present in / and /* a further Main cleavage site is 
created and thus digestion yields products of 80bp and 54bp and 41bp (see 
figure 3, example 2). A number of other oligonucleotides suitable as PCR 
primers could easily be derived from the sequence of this region of the porcine 
genome. An example of such a PCR-RFLP test is given in example 3. The 
results that would be obtained from such a PCR-RFLP test described above are 
as shown below: 



Genotype 


Fragment sizes 


Fragment sizes 




134 + 41 bp 


80 + 54 + 41 bp 


III 


Yes 


Yes 


I/f 


Yes 


Yes 


in 


Yes 


Yes 


f/f 


Yes 


No 


f/i 


Yes 


No 


Hi 


Yes 


No 


1*11 


Yes 


Yes 


1*11* 


Yes 


Yes 


VI* 


Yes 


Yes 


i*/r 


Yes 


Yes 



Thus, simply by analysing restriction products certain genotypes can be 
distinguished. However, there are others which cannot be distinguished in this 
first configuration of the test. In a further refinement the test can be carried out 
in such a way that the amount of each fragment can be calculated. By carrying 
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out electrophoresis on an apparatus that allows the quantification of each of the 
bands one can determine the ratio of the two forms of template in the genomic 
l^A sample used7~Exsulples _ of such an apparatu^n^lu'de~me""Perkin~Elmer" 
Applied Biosystems 373 and 377 DNA sequencing systems- The application of 
this type of equipment is illustrated in example 4. Any equipment capable of 
determining the relative amounts of the products from the two different 
sequences is equally applicable to such tests. The expected results from such a 
test are shown below. 



Genotype 



10 



/// 
I/F 
HI* 
Hi 
F/F 
P/I* 
F/i 
I*li 
1*11* 
i/i 



Normal KIT 
sequence 
Copies 
2 
3 
2 
2 
4 
3 
3 
2 
2 
2 



Splice mutant KIT 
Copies 

2 
1 
2 
1 
0 
1 
0 
1 
2 
0 



Ratio Normal 
Splice mutant 



1 
3 
1 
2 
0 
3 
0 
2 
1 
0 



Thus, using such a test one could identify all animals carrying alleles of 
dominant white that might dispose themselves or their offspring to exhibiting 
non-white coat colour (i ovF)as those giving a ratio other than one. Depending 
on the requirement and the derivation of the lines under selection one could take 
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an appropriate subset of animals. For example in a cross derived animals 
carrying only alleles I and i one could identify any white individuals {III or Hi) 
carrying / as they would have a ratio in the test oTTTas oppos~ed~to~I for the /// 
animals. The distinction of alleles 7 and 7* can be carried out on the basis of the 
5 4bp deletion as described in the previously filed patent publication 
WO97/05278. 

There are a range of techniques by which differentiation of alleles containing 
the splice mutation and those containing the normal sequence could be 
10 differentiated by a person expert in the field. 

Analysis of the genetic composition of an animal could be based upon a number 
of different source materials. These include genomic DNA, RNA and the KIT 
protein itself. There may also be effects on the levels and nature of other 
15 proteins, metabolites and RNA species which could be measured to create a 
more indirect assay. 

DNA could be used as the basis for a number of approaches to testing. One 
approach is through the amplification of the region of DNA containing the 

20 polymorphism using the polymerase chain reaction. This could then be linked to 
a number of forms of analysis of the product. Examples of electrophoresis 
based technologies include Single Strand Conformation Polymorphism (SSCP), 
Restriction Fragment Length Polymorphism (RFLP) and DNA sequencing of 
PCR products or direct genome sequencing. Other PCR based techniques that 

25 might alternatively be used include the Perkin Elmer TaqMan systems, Single 
Nucleotide Polymorphic Extension (SNuPE) and Minisequencing. PCR 
products from the region might also have application in hybridization based 
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approaches to the differentiation of alleles at this locus. Hybridization methods 
might include the probing of Southern transfers of genomic DNA with allele 
specific oligonucleotide, RNA, DNA fragment orProteinN^ 
probes. Other strategies of application here include hybridization of genomic 
5 DNA or PCR products (specific or whole genome) to oligonucleotide arrays 
possibly in the form of 'DNA chips'. Such arrays could consist of any reagent 
capable of binding DNA or RNA derived from the locus in question in an allele 
specific manner. Further useful methods of analysis also include 
oligonucleotide ligation assay and the ligase chain reaction. For a review on 
10 methods for detecting point mutations see Landegren, 1996, Laboratory 
Protocols for Mutation detection, Oxford University press, Oxford. 

A number of effects on the RNA produced from the gene in question have 
already and may in the future be observed. All such differences between the 

15 mutated and normal forms of the gene are useful targets for the determination of 
genotype and a large range of methods are available to the person skilled in the 
art The changes that are or might be observed and methods of analysis are as 
follows. Alteration of the size, rate of processing, stability and quantity of RNA 
transcripts could be measured through widely used techniques such as northern 

20 blotting and RT-PCR as well as a number of the techniques described above for 
DNA analysis such as hybridization to oligonucleotide or DNA fragment 
arrays. 

Another approach which can be used is to use a linked genetic polymorphism 
25 which is closely associated with the presence or absence of the alteration at the 
exon/intron boundary. Such a polymorphism may occur in the KIT gene itself or 
in a chromosomal region linked to KIT. By using a single linked marker in 
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complete association with the presence/absence of the duplication or a 
combination of markers showing a partial association a highly informative test can 
be developed For instance, ~theT SSCT^~(SSigle Strand Conformation 
Polymorphism) method may be used to develop such polymorphism. The 
5 principle of the method is that double-stranded DNA, produced by PCR, is 
denatured into single-stranded DNA which is then separated by non-denaturating 
gel electrophoresis. Under non-denaturating conditions the single-stranded DNA 
forms a secondary structure due to intra-strand interaction but a proportion of the 
single-stranded DNA will renature and form double-stranded DNA. Two types of 

10 polymorphism may be revealed by this method. Firstly, a difference in nucleotide 
sequence between two alleles may influence the secondary structure of 
single-stranded DNA which is revealed as a difference in the mobility rate during 
electrophoresis. Secondly, a difference in nucleotide sequence often influences 
the mobility of the heteroduplex DNA (A heteroduplex is a double-stranded DNA 

1 5 molecule formed by two single-stranded molecules representing different alleles). 

Association between genetic markers and genes responsible for a particular trait 
can be disrupted by genetic recombination. Thus, the closer the physical distance 
between the marker and the gene in question, the less likely it is that 
20 recombination will separate them. 

It is also possible to establish linkage between specific alleles of alternative DNA 
markers and alleles of DNA markers known to be associated with a particular 
gene (e.g. the KIT gene discussed herein), which have previously been shown to 
25 be associated with a particular trait Thus, in the present situation, taking the KIT 
gene, it would be possible, at least in the short term, to select for pigs with a 
particular coat colour, indirectly, by selecting for certain alleles of a KIT gene 
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associated marker through the selection of specific alleles of alternative 
chromosome 8 markers. Examples of such markers known to be linked to the KIT 
gene on porcine c&omosom^ 8~include "genetic polymorphism inTEe KIT gene 
itself or in the closely linked genes for the a-subunit of platelet derived growth 
factor (PDGFRA) and albumin. 

Particular genetic markers associated with the KIT gene are microsatellites. These 
are simple sequence repeats of 4, 3 or, more usually, 2 nucleotides, which occur 
essentially at random around the genome at approximately every 50,000 bases 
(about 60,000 microsatellites per haploid genome). Stuttering of DNA 
polymerase during replication and unequal crossing-over during recombination 
are thought to result in the loss or gain of repeat units. This means that 
microsatellites are usually polymorphic and can have several repeat length alleles. 

Examples of linked microsatellite sequences include S0086 (Ellegren et al, 
Genomics, 16:431-439 (1993)) , S0017 (Coppieters et al, Animal Genetics 24: 
163-170 (1993)), Sw527, Swr750 and SW916 (Rhorer et al, Genetics, 136:231- 
245 (1994)) . It would be possible to select indirectly for alleles of the KIT gene 
linked to coat colour using any of the above markers, or indeed any other linked 
markers on porcine chromosome 8. 

Alterations in the level of the KIT protein could be measured using either 
specific antibodies for example in an ELISA system or on western blots or 
through the use of a range of biochemical techniques to measure the activity of 
the protein. Such tests could also be applied to other proteins or metabolites, 
the level or nature of which is altered by the presence of specific alleles at the 
locus. Different protein structures due to the presence of specific alleles could 
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be identified through the use of structure specific antibodies. As with the DNA 
and RNA based methods all these protein methods could be applied in a 
quantitativefashlra^ 

Kits could be produced for the specific analysis of the polymorphism described 
here alone and also with reagents allowing the combined analysis of the other 
polymorphisms previously reported and the subject of patent publication 
WO97/05278. 

The invention will now be described with reference to the following examples, 
which should not be construed as in any way limiting the invention. The 
examples refer to the figures in which: 

Figure 1: is a schematic representation of the structure of the known 
porcine KIT alleles, where 4bp del'n refers to the 4bp deletion in intron 
1 8 of one copy of the duplicated KIT gene DNA as reported by Moller et 
al. 1996 and Exon 17-A refers to the change of nucleotide 1 of intron 17 
from a G as in the wild type allele to an A as reported in this patent; 

Figure 2: shows an electropherogram (4% agarose Nusieve/Seakem 3:1; 
1 00V for 80 min) showing RT-PCR products of KIT exon 16-19 with the 
primers KIT1F and KIT7R. The samples 1-3 and 4-6 are Swedish Large 
White and Hampshire pigs, respectively. The size difference between 
the 424 and 301 bp fragments is due to lack of exon 17 in the latter 
fraction. The two upper bands of the Yorkshire pigs were interpreted as 
heteroduplexes (HD); 
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Figure 3: shows a 48 bp sequence comprising 21 bp of KIT exon 17 and 
27 bp of KIT intron 17 where the position of the intron/exon border is 



marked with a vertical line , the splice site mutation (ntl ) indicated 
with a vertical arrow and identical bases in alleles F and / are marked 
5 with a dot; 

Figure 4: shows the results of Maffl PCR RFLP test used to detect the 
presence of a splice site mutation in intron 17 of the KIT gene. Figure 
4A shows the position of two NlaUl recognition sites within the PCR 

10 product amplified using primer pair KTT21 and KTT35. All distances are 

given in base pairs. Figure 4B shows the size of fragments which result 
following NlaUl digestion of either normal KIT or splice mutant KIT. 
Figure 4C illustrates use of the PCR RFLP test Lane 1 shows the 
KIT21/KIT35 amplified fragment undigested. Digestion was performed 

15 on PCR products amplified from, in Lane 2: a clone which contains the 

splice site mutation; Lane 3: a clone which contains the normal splice 
site sequence; Lane 4: genomic DNA from a coloured pig; Lane 5: 
genomic DNA from a white pig. Fragment sizes are given in base pairs; 

20 Figure 5: shows a comparison of the ratio of normal to splice mutant 

KIT in animals of genotypes I/I, I/i and I/F\ 

Figure 6: shows the ratio values for 56 Landrace and 33 Large White 
animals. A clearly bimodal distribution is observed with 7 Landrace and 
25 3 Large White individuals having a ratio value of approximately 3 or 

above, suggesting them to be heterozygous carriers for the f allele 
(genotype I/f). This means f has gene frequency estimates of 6.25% 
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(7/1 12 chromosomes tested) and 4.5% (3/66 chromosomes tested) within 
the Landrace and Large White breeds respectively; and 



FIGURE 7: shows a plot of Ct FAM versus Ct TET for animals of 
genotypes Ii and // analysed for KIT splice mutant genotype using 
TaqMan® chemistry. 



Example 1 

10 

RT-PCR of porcine CT^exon 16-19 



i. mRNA purification from blood samples 

Fresh blood samples were collected in citrate tubes from coloured Hampshire 
15 pigs and Large White pigs. Leukocytes were isolated from 5 ml blood using 
Ficoll 100 (Pharmacia Biotech). Isolation of mRNA from leukocytes was then 
carried out using the Quickprep Micro mRNA purification kit (Pharmacia 
Biotech). The mRNA was stored as a precipitate under ethanol at -70°C for up 
to one month before use in reverse transcriptase (RT)-PCR. 

20 

ii. RT-PCR of KIT exon 16-19 

First-strand cDNA synthesis was accomplished using the First-Strand cDNA 
Synthesis kit (Pharmacia Biotech) so that -100 ng mRNA was randomly 
primed by 0.1 p-g pd(N6) in a total volume of 15 pi. Two ul of the completed 
25 first cDNA strand reaction was then directly used per 12 ul PCR reaction by 
adding 10 ul PCR mix containing 10 pmol each of the mouse/human derived 
primers KIT1F and KIT7R (5'-TCR TAC ATA GAA AGA GAY GTG ACT C 
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and 5'-AGC CTT CCT TGA TCA TCT TGT AG, respectively; Moller et aL 
1996, supra), 1.2 ^il 10 x PCR-buffer (10 mM Tris-HCl, pH 8.3, 50 mM KC1) 
and 0.5HD of AmpliTaq polymerase (Perkin-Elmer) incubated with an equaT 
amount Taqstart antibody (Clontech) at 25 °C for 5 min to achieve a hot start 
5 PCR. The reaction was covered with 20 fj.1 mineral oil and thermocycled in a 
Hybaid Touchdown machine (Hybaid) with 40 cycles at 94°C for 1 min, 55-48 
°C (touchdown one degree per cycle the first seven cycles and then 48°C in the 
remaining cycles) for 1 min and 72°C for 1 min. After PCR 2|xl loading dye 
was added to each sample which were then loaded on 4% agarose gel 
10 (Nusieve/Seakem 3:1, FMC Bioproducts) and electrophoresed with 100V for 80 
min. Products were visualised by ethidium bromide staining and UV- 
illumination. 

iii.Cloning and sequencing of RT-PCR-products 

15 The RT-PCR products representing KIT exon 16-19 were purified by extraction 
from 2% agarose gels using the QIAEX gel extraction kit (QIAGEN) and 
cloned into the pUC18 vector using the Sureclone ligation kit (Pharmacia 
Biotech). Plasmids were isolated using the QIAFilter plasmid Midi kit 
(QIAGEN). Cloned plasmid inserts were sequenced using dye primer 

20 chemistry. Each cycling reaction was prepared with plasmid template DNA and 
ready reaction mix containing fluorescently labelled M13 forward or reverse 
primer as described in the ABI Prism protocol P/N 402113 (Perkin Elmer). 
Cycling and sample pooling were performed using a Catalyst 800 Molecular 
Biology Workstation (ABI) following the instruments user manual (Document 

25 number 903877, Perkin Elmer). The resulting extension products were purified, 
loaded and analysed using the 377 ABI Prism sequencer as described by the 
instrument protocol P/N 402078 (Perkin Elmer). 
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iv Results and discussion 

A 424 bp fragment including KIT cDNA exon 16-19 was amplified from all 
pigs. The Hampshire pigs did not show any additional products whereas the 
5 Large White pigs (eight tested) all showed a 301 bp truncated cDNA fragment 
(Fig 2). Sequence analysis revealed the 424 bp fragment was identical in the 
two breeds whereas the whole exon 17 (123 bp) was missing from the 301 bp 
fragment. Apparent differences between individuals regarding the relative 
amounts of these two products may have been caused either by different 
10 genotypes containing differing numbers of copies of the KIT gene sequence, 
individual differences in mRNA expression levels or random RT-PCR effects. 

The two upper fragments present in Large white pigs represent heteroduplexes 
between the 301 and 424 bp fragments (Fig. 2). This was shown by an 
15 experiment where these slow migrating fragments were generated by pooling 
homoduplexes of the 424 and 301 bp which were then heat denatured and 
cooled to 25°C. Moreover, cloning of the lower heteroduplex fraction of a Large 
White pig resulted in clones with insert length corresponding to either of the 
two homoduplexes. 

20 

Example 2 

PCR Amplification and Sequencing of KTT Exon 17-Intron 1 7 (5' Splice Site) 

i. PCR to produce DNA Sequencing Template 
25 A 175 bp region including the boundary between exonl7 and intronl7 of the 
KIT gene was amplified for sequence analysis using forward primer KIT21 (5' - 
GTA TTC ACA GAG ACT TGG CGG C -3') and reverse primer KIT35 (5' - 
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AAA CCT GCA AGG AAA ATC CTT CAC GG - 3'). PCR was carried out on 
a DNA thermal cycler (Perkin Elmer 9600) in a total volume of 20 ul 
containing 25 ng genomic DNA, 1.0 mM MgCl^ 50 mM KC1, 10 rnM Tris- 
HC1, pH 8.3, 200 uM dNTPs, 0.5 U AmpliTaq Gold (Perkin Elmer) and 10 
5 pmol of both KIT21 and KIT35 primer. To activate AmpliTaq Gold, initial heat 
denaturation was carried out at 94°C for 10 minutes followed by 32 cycles each 
consisting of 45 sec at 94°C, 45 sec at 55°C and 45 sec at 72°C. The final 
extension lasted for 7 min at 72°C. PCR products were cloned into vector 
pUC 18 using the SureClone ligation kit (Pharmacia Biotech). 

10 

ii. Preparation Of Plasmid DNA 

Plasmid DNA was purified from overnight bacterial culture using the Jetstar 
plasmid midi kit (Genomed) and the resulting DNA diluted to 1 50 ng/ul. 

i5 iii.Sequencing of plasmid DNA 

DNA was sequenced as in example 1 section iii. 

iv.Results 

A portion of the DNA sequence from exon 17 and intron 17 of the KIT gene 
20 was determined and compared between animals with each of these three alleles. 
Figure 3 shows that the / allele carries a splice site mutation at position 1 of 
intron 17. This G to A base substitution is present in one of the two gene copies 
carried on each chromosome. The base substitution occurs in the invariant GT 
dinucleotide which characterises 5' exon/intron boundaries. Analysis of the f 
25 allele showed the splice site mutation was not present in either the normal 
(KIT1) or duplicated copy of the gene (KIT2). We have found the splice site 
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mutation is unique to the / alleles, and therefore makes it possible to distinguish 
the I-KTT2 sequences. 



Example 3 

5 

Testing For the Presence of the Splice Site Mu tation with PCR RFLP 

To easily test for the presence of the G to A splice site mutation, restriction 
endonuclease Main (CATG) was used to exploit the point substitution 
10 identified at position 1 of intron 17 (Figure 3). The Main recognition sites in 
the fragment amplified from KIT and the expected restriction products are 
illustrated in Figure 4A and 4B respectively, 

15 

i. DNA preparation for RFLP Test 

DNA can be prepared from any source of tissue containing cell nuclei, for 
example white blood cells, hair follicles, ear notches and muscle. The 
procedure here relates to blood cell preparations; other tissues can be processed 
20 similarly by directly suspending material in K buffer and then proceeding from 
the same stage of the blood procedure. The method outlined here produces a 
cell lysate containing crude DNA which is suitable for PCR amplification. 
However, any method for preparing purified, or crude, DNA should be equally 
effective. 

25 

Blood was collected in 50mM EDTA pH 8.6 to prevent coagulation. 50ul of 
blood was dispensed into a small microcentrifuge tube (0.5ml Eppendorf or 
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equivalent). 450pl of TE buffer was added to lyse the red blood cells (haem 
groups inhibit PCR) and the mix vortexed for 2 seconds. The intact white and 
residual red blood cells were then centrifugedTor 12 seconds at I370OO g inaT 
microcentrifuge. The supernatant was removed by gentle aspiration using a low 
5 pressure vacuum pump system. A further 450|il of TE buffer was then added to 
lyse the remaining red blood cells and the white blood cells collected by 
centrifugation as before. If any redness remained in the pellet, this process was 
repeated until the pellet was white. After removal of the last drop of 
supernatant from the pelleted white blood cells, 100|ul of K buffer containing 
10 proteinase K was added and the mixture incubated at 55 degrees C for 2 hours. 
The mixture was then heated to 95-100 degrees C for 8 minutes and the DNA 
lysates stored at -20°C until needed. 

Reagents 

15 T.E. Buffer: lOmM TRIS-HC1 pH8.0 

ImM EDTA 

K Buffer 50mM KC1 

lOmM TRIS-HC1 pH8.3 
20 2.5mM MgC12 

0.5%Tween20 

ii. Restriction Enzyme Digestion and Electrophoresis 

The PCR amplification product is 175 bp in length. To test for polymorphism at 
25 position 1 of intron 17, digestion reactions were set up as below: 

3.0 nl PCR amplified DNA 
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1.0 nl 10XNEBuffer4 
0.1 ill BSA 100 ng/ml 

o.ini.Mani.io.u/|Jii , _ 

5.8 Ml dH20 

(IX NEBuffer 4 (New England Biolabs) contains 50 mM potassium acetate, 20 
mM Tris acetate, 10 mM magnesium acetate and 1 mM DTT). Following 
incubation at 37°C for 90 minutes each 10 ul reaction volume had 2 ul of 
loading dye added and the mix loaded on a 8% native polyacrylamide gel 
(Protogel, 37.5:1 acrylamiderbisaciylamide , National Diagnostics, Atlanta) in 
0.5 X TBE (44 5 mM Tris pH 8.0, 44.5 mM boric acid and 0.5 mM EDTA) and 
electrophoresed for 3 hours at 200V in a vertical slab unit (SE600 Hoefer 
Scientific Instruments). Products were visualised by ethidium bromide staining. 

iii Results 

A PCR RFLP protocol was designed to test for the presence of the splice site 
mutation as the substitution occurs within the recognition site for restriction 
endonuclease NlaUl. Figure 4B illustrates that presence of the G to A base 
substitution at position 1 of KIT intron 17 results in restriction at each of two 
MfllH recognition sites within the 175 bp DNA fragment. Following 
electrophoresis, this results in fragments of sizes 80 bp, 54 bp and 41 bp. Where 
the splice site mutation is absent however, incubation with Mam results in 
digestion only at recognition site 1. Following electrophoresis this results in 
fragments of 134 bp and 41 bp. The invariant Malll recognition site 1 serves as 
an internal control to ensure complete digestion has taken place. Results of this 
PCR RFLP analysis are illustrated in Figure 4C. Analysis was performed on 
fragments amplified from clones which either carry the splice site mutation 
(lane 2) or carry the normal splice site sequence (lane 3). Lane 4 shows the 
result of analysis where DNA amplified from the genomic DNA of a coloured 
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animal was used. Lane 5 shows the resulting bands where a white animal was 
tested. The test was used to analyse 121 individuals from seven different breeds 
of pig. The splice site mutation was tbund only in the 97 animals with the 
dominant white phenotype (//- or I*/i) and none of the 24 coloured (f or f) 
5 examples (Table 2). This analysis confirms I and /* to be unique in that they are 
the only alleles to carry the splice site mutation. 

TABLE 2 

Distribution of the Splice Site Mutation Between Different Breeds and Coat 
10 Phenotype 



Breed 


Coat 


Assumed 


Animals 


Normally 


Splice 




Colour 


Genotype 1 


Tested 


spliced KIT 2 


Mutation 2 


Large White 


White 


V- 


33 


33 


33 


Landrace 


White 


V- 


56 


56 


56 


Hampshire 


Coloured 


i/i 


5 


5 


0 


Duroc 


Coloured 


i/i 


5 


5 


0 


Pietrain 


Coloured 


i/i 


8 


8 


0 


Meishan 


Coloured 


i/i 


5 


5 


0 


Wild Boar 


Coloured 


i/i 


1 


1 


0 


Wild Boar 


White 


I*/- 


8 


8 


8 


x Large 












White 












Totals 














White 


I/- 


89 


89 


89 




White 


I*/- 


8 


8 


8 




Coloured 


i/i 


24 


24 


0 



White animals may be homozygous or heterozygous for the / allele 
2 Presence of the splice site mutation determined by Nlalll PCR RFLP test 



15 
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Example 4 

Quantification of Norm al KIT and S plice Mutant K IT rtntron 17 ntl G ^± 

As the splice site mutation is present in only one of the duplicated regions of I 
and not in the duplicated region of/, the various genotypes can be expected to 
have the attributes described in Table 3. 
TABLE 3 



Genotype 


Copies 


of Normal Copies of KIT 


Ratio of normal KIT 




KIT 


containing the splice 


to splice mutant KIT 






mutation 




I/I 




2 2 


1:1 


I/i 




2 1 


2:1 


i/i 




2 0 


2:0 


I/I p 




3 1 


3:1 


I p /i 




3 0 


3:0 



Due to the dominance of allele /, three of the genotypes in Table 2 are carried 
by white animals and therefore can not be identified by phenotypic 
characterisation. Quantification of the relative amounts of the normal KIT gene 
and the splice mutant KIT gene allows the ratio between the two to be 
calculated, and therefore the genotype of individual animals predicted. This was 
achieved by quantification of two DNA fragments following N!dm digestion. 
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The amount of 134 bp fragment, representative of the normally spliced KIT 
gene, and of 54 bp fragment, representative of the splice mutant KIT, were 
measuredToIlowing electrophoresis using UeneScan software. 

5 i. PCR to Produce DNA for Quantification 

As described in example 2 section i. The reverse primer KIT35 is labelled with 
the ABI fluorescent dye FAM at the 5' end. 

ii Restriction Enzyme Digestion 

10 As described in example 2 section ii. 

iii Electrophoresis and Quantification of DNA Fragments 

Following digestion, 0.5 pi °f the reaction volume was mixed with 2.5 jjlI of 
deionised formamide, 0.5 fj.1 of GS350 DNA standard (ABI) and 0.4 \i\ blue 

15 dextran solution before being heated to 90°C for 2 minutes and rapidly cooled 
on ice. Three jal of this mix was then loaded onto a 377 ABI Prism sequencer 
and the DNA fragments separated on a 6% polyacrylamide gel in 1 X TBE 
buffer for 2 hours at 700 V, 40 mA, 32 W. The peak area of fragments 
representative to both the normal and splice mutant forms of KIT were 

20 quantitated using the GeneScan (ABI) software. 

iv. Ratio Calculations 

The peak area value of the 134 bp fragment (normal KIT) was divided by twice 
the peak area value of the 54 bp fragment (splice mutant KIT) in order to 
25 calculate the ratio value for each sample. 
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v. Results 

Analysis was performed on animals from the Swedish wild pig/Large White 
_ " intercross pedigree for which genotypes at / have been determined by 
conventional breeding experiments with linked markers. Figure 5 and Table 4 
5 show the ratio of normal to mutant KIT calculated for animals from each of the 
three genotype classes, I/I (expected ratio 1:1), I/i (expected ratio 2:1) and Iff 
(expected ratio 3:1). The results are entirely consistent with the expected ratio 
values and indicate that the three genotype classes can be distinguished using 
this method. 

10 

TABLE 4 

Ratio of the Two KIT Forms in Different Dominant White Genotypes in a 
Wild Pig/Large White Intercross 

15 



Genotype 


Phenotype 


Expected 

Ratio 
(Normal: 
Mutant) 


Observed Ratio 
(Normal:Mutan 

t) 
±SE 


Number 
Tested 


I/I 


White 


1:1 


1.15 ±0.075 


13 


I/I p 


White 


3:1 


3.11 ±0.084 


12 


I/i 


White 


2:1 


2.23 ±0.109 


14 



Figure 5 illustrates that the range of ratio values calculated for the two 
genotypes I/I and I/f do not overlap. This enables animals carrying the / allele 
to be identified and the frequency of the allele within different pig breeds 
20 determined. Ratio values were calculated for 56 Landrace and 33 Large White 
animals and the results are shown in Figure 6. A clearly bimodal distribution is 



SUBSTITUTE SHEET (RULE 26) 



WO 99/20795 



28 



PCT/GB98/03081 

i 



observed with 7 Landrace and 3 Large White individuals having a ratio value of 
approximately 3 or above, suggesting them to be heterozygous carriers for the 
^T'' allele (genotype Z^l This means f has gene frequency estimates of 6.23% 
(7/1 12 chromosomes tested) and 4.5% (3/66 chromosomes tested) within the 
Landrace and Large White breeds respectively. 

Example 5 

Analysis for presence and quantification of the porcine KIT splice mutation 
using the PE ABI TaqMan chemistry 

Method 

L Preparation of template DNA for PGR 
DNA was prepared as in example 3, section i 
iL TaqMan® PCR reactions 

TaqMan® PCR reactions were set up as shown in table 5 
TABLES 



P CR mix for TaqMan based splice mutation test 



Reagent 


Final Cone 2 


Volume 


lOx TaqMan® Buffer A (Perkin Elmer) 


lx 


2.50 ul 


25mM MgCl 2 Sol B 


5mM 


S.OOfil 


DATP 


200uM 


0.50 ul 


DCTP 


200uM 


0.50 ul 


DGTP 


200uM 


0.50 ul 


DUTP 


200uM 


0.50 


Amplitaq Gold™ (5U/ul) (Perkin Elmer) 


0.05U/ul 


0.25 ul 


AmpErase™N-Glycosylase (lU/ul) (Perkin 


O.OlU/ul 


0.25 ul 


Elmer) 






KITTM -NEST-F (5uM) 


500nM 


2.50 ul 


KITTM-NEST-R (5uM) 


500nM 


2.50 ul 


KITTM FAM(5uM) 


lOOnM 


0.50 ul 


KITTM TET (5uM) 


lOOnM 


0.50 ill 
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25% Glycerol 8% 8.00*11 

Porcine genomic DNA LQQjd 

25.00 

- ' ' ' til 

The PCR primers used were as described below: 

KITTM-Nest-F (5'-CTC CTT ACT CAT GGT CGA ATC ACA-3 ') and 
KITTM -Nest-R (5'-CGG CTA AAA TGC ATG GTA TGG-3 '). 
The TaqMan® probes used were: 

KITTM-A FAM (5'-TCA AAG GAA ACA TGA GTA CCC ACG CTC- 
3') and 

KITTM -G TET (5*- TCA AAG GAA ACG TGA GTA CCC ACG C -3') 

The TaqMan® probes were prepared by Perkin Elmer and labelled with FAM 
and TET as indicated as well as the standard quenching group TAMRA. The 
lOx TaqMan® Buffer A, Amplitaq Gold™, AmpErase N-Glycosylase, NTP's 
and 25mM MgCl 2 used were part of the TaqMan® PCR Core reagent Kit, 
supplied by Perkin-Elmer. 

The reactions were then placed into a Perkin Elmer ABI Prism 7700 Sequence 
Detector and the reaction carried out using the following thermal profile, 50°C 
for 2 minutes, 95°C for 10 minutes followed by 40 cycles of 95°C 15s, 62°C 
60s. The reactions were carried out under the control of 'Sequence Detector 
V.1.6' software using the 'Single Reporter* and Real-Time' options with the 
'Spectral Compensation' function activated. Upon completion of the run real- 
time profiles for each sample were examined on the ABI7700 to check for any 
samples giving highly irregular profiles which were then excluded. The 
thresholds for both dyes, Fam and Tet, were set so that they intercepted each 
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dye during the exponential phase of PCR. Following updating of the 
calculations in 'Sequence Detector V.1.6' software results were exportated into 
MS ExcelToi^fiirffier analysis. ~ 

iii. Analysis of results 

Based upon the underlying theoretical principle that one cycle of PCR gives a 
doubling in the amount of cleavage of the quenching dye from the allel 
specific probe and therefore doubles the signal one would expect the threshold 
cycle numbers from the II and Ii genotypes analysed to be as below: 



Table 6: 

Theoretical results for TaqMan® analysis of genotype at the KIT splice 
mutation 



Genotype 


Copies KIT 1 


Copies KIT 2 


Theoretical Ct 


Theoretical Ct 




(G) 


(A) 


TET(G) . 


F AM (A) 


n 


2 


2 


X 


Y 


Ii 


2 


1 


X 


Y+l 



In theory the Ct for TET and FAM signals, represented as X and Y should be 
the same, as equal numbers of copies of the target sequences should be present 
in an 77 animal. However in practice this does not necessarily occur due to 
differences in the hybridization and cleavage efficiency of the two probes and 
variation in the setting of the threshold cycle between the two dye signals. The 
reduction in splice mutant containing (A) sequences relative to those not 
containing the splice mutation (G) in the Ii animals ie 2:1 G:A ratio rather than 
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1:1 as for /7 genotype, should lead to the FAM signal reaching the threshold 1 
cycle later than the TET signal in the genotype Ii animals. The actual results for 
samples tested are shown in Table 7. 

Table 7 

Ct values from analysis of II and Ii genotypes 

Sample Genotype Ct FAM (A) CtTET Ct F AM - Ct TET 



(G) 


1 


Ii 


24.68 


22.59 


2.09 


2 


Ii 


25.98 


23.62 


2.36 


3 


Ii 


26.54 


25.57 


0.97 


4 


Ii 


27.37 


24.78 


2.59 


5 


Ii 


24.94 


21.61 


3.33 


6 


Ii 


25.68 


22.1 


3.58 
Ii Mean = 2.49 


7 


II 


22.05 


23.78 


-1.73 


8 


II 


24.22 


24.59 


-0.37 


9 


n 


24.19 


23.85 


0.34 


10 


n 


23.66 


23.51 


0.15 


11 


II 


24.35 


22.71 


1.64 


12 


n 


22.82 


21.69 


1.13 


13 


n 


22.84 


22.7 


0.14 


14 


n 


23.17 


22.9 


0.27 
Mean = 0.20 
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No 35 35 0 

Template 

No ~ 35 35 0 

Template 

No 35 35 0 

Template 

No 35 35 0 

Template 



Despite variation around the mean values it can be seen from Table 7 that there 
is a significantly increased delay in the FAM signal reaching the threshold level 
(approximately 2 cycles) relative to the TET signal in Ii animals compared to II 
animals as predicted, reflecting the reduced number of copies of the splice 
mutant (A) sequence present in animals of the Ii genotype. Plotting of the 
individual samples on a scatter plot (Figure 7) shows clustering of the two 
genotypes with the Ii cluster shifted along the Ct FAM axis due to the reduced 
number of copies of the KIT2 (A) sequence for which the FAM probe is 
specific. 
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CLAIMS: 

1. A method for detennini^^oafcdlowr genotype in a^ig which comprises: 

(a) obtaining a sample of pig nucleic acid; and 

(b) analysing the nucleic acid obtained in (a) to determine whether a mutation 
is/is not present at one or more exon/intron splice sites of the KIT gene. 

2. A method as claimed in claim 1 wherein the analysis in step (b) is carried 
out to determine whether a mutation is/is not present at the exon 17/intron 17 
boundary. 

3. A method as claimed in claim 2 wherein the mutation consists of the 
substitution of the G of the conserved GT pair for A. 

4. A method as claimed in any one of claims 1 to 3 wherein the sample of 
nucleic acid is amplified prior to analysis. 

5. A method as claimed in claim 4 wherein the nucleic acid is genomic 
DNA. 

6. A method as claimed in claim 5 wherein amplification is carried out 
using PGR and at least one pair of suitable primers. 

7. A method as claimed in claim 6 wherein the pair of suitable primers is: 



SUBSTITUTE SHEET (RULE 26) 



WO 99/20795 



34 



PCT/GB98/03081 



5'-GTA TTC ACA GAG ACT TGG CGG C-3 '); and 
5 -AAA CCT GCA AGG AAA ATC CTT CAC GG-3\ 

8. A method as claimed in any one of claims 5 to 7 wherein after 
amplification the nucleic acid is treated with a restriction enzyme, followed by 
analysis of fragment lengths. 

9. A method as claimed in claim 8 wherein the nucleic acid is treated with 
the restriction enzyme NlaTH. 

10. A method as claimed in claim 8 or claim 9 wherein the ratio of 
restriction fragment lengths is determined. 

11. A method as claimed in claim 4 wherein the nucleic acid is mRNA. 

12. A method as claimed in claim 1 1 wherein the nucleic acid is amplified 
using RT-PCR. 

13. A method as claimed in claim 12 wherein the length of RT-PCR product 
is determined. 

14. A method for determining coat colour genotype in a pig which comprises the 
step of analysing a sample of pig KIT protein to determine whether the protein is the 
splice variant protein. 
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15. A kit for use in determining the coat colour genotype of a pig which 
comprises one or more reagents suitable for determining whether a mutation is 
present at one or more exon/intron splice sites of the KIT gene. 

5 

16. A kit as claimed in claim 1 5 which comprises one or more reagents for 
carrying out PCR and one or more pairs of suitable primers. 

17. A kit as claimed in claim 16 which comprises the following pair of 
10 primers: 

5'-GTA TTC ACA GAG ACT TGG CGG C-3'); and 
5 '-AAA CCT GCA AGG AAA ATC CTT CAC GG-3'. 
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Sequence Alignment Across the Exon /Intron Border of KIT Exon 17 

Allele Gene Copy Sequence 

Exon 17 Intron 17 

I KIT1 AAT TAC GTG GTC AAA GGA AAC I GTG AGT ACC CAC GCT CTC CTG ACA GTC 
KIT2 |A. 

I p KIT1 ...1G... 

KIT1 IG - 



x 



KIT1 IG. 



FIG.3 



SUBSTITUTE SHEET (RULE 26) 



WO 99/20795 



PCT/GB98/03081 



1/8 



X 



CL 

JO 



o 
oo 



CO 



CO (L> 

Z 



CL 



CL. 
X 



OO 



c 

O 

OX) 



z 



A 0) 

Z£ 



cm 
5 



o 







ntKIT 




Norma 


lice Muta 


o 






LL 







SUBSTITUTE SHEET (RULE 26) 




SUBSTITUTE SHEET (RULE 26) 



WO 99/20795 



PCT/GB98/03081 



6/8 



Ratio of normal to splice mutant KIT 
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Ratios for Landrace and Large White 
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Plot of Ct FAM vs Ct TET for TaaMan based PCR 
analysis of porcine KI T splice mutation genotype 
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